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Summary 

Bacteria from the Bacteroidetes phylum are known 
producers of the chemotaxonomic relevant flexiru- 
bins. These orange pigments comprise a non- 
isoprenoid aryl-polyene carboxylic acid esterified 
with a diall<ylresorcinol. Herein, we report a gene 
cluster from Chitinophaga pinensis encoding the 
biosynthesis of the polyene moiety and the biochemi- 
cal characterization of a tyrosine ammonia-lyase and 
a 4-coumarate-CoA ligase responsible for the initia- 
tion of the polyene biosynthesis. Additionally, the 
flexirubin of C. pinensis was characterized by a com- 
bination of feeding experiments, high-performance 
liquid chromatography tandem mass spectrometry 
and matrix-assisted laser desorption/ionization mass 
spectrometry. 

Introduction 

Since the discovery of flexirubin (1) (Fig. 1) in 
Chiitinophiaga fiiiformis (previously Flexibacter elegans Fx 
e1) (Reichenbach etal., 1974; Kampfer etal., 2006), 
flexirubin-type pigments have been used as 
chemotaxonomic markers for the bacteria of the 
Bacteroidetes phylum (previously called the Cytophaga- 
Flavobacterium-Bacteroides group). Their conserved 
structural feature is a (o-(4-hydroxyphenyl)-polyene 
carboxylic acid chromophore, esterified with a 2,5- 
dialkylresorcinol (DAR). Known derivatives differ in 
methylation and/or chlorination pattern at the polyene ring 
and the chain lengths of the DAR alkyi chains as well as 
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the polyene chain length (Achenbach etai., 1982). 
Feeding experiments with radioactive precursors com- 
bined with chemical degradation showed that tyrosine is 
the precursor of the polyene ring and that its methyl group 
is derived from methionine (Achenbach etai., 1979). As 
labelled acetate and malonate were also incorporated, 
it was proposed that fatty acid and/or polyketide 
biosynthesis mechanisms are involved (Fautz and 
Reichenbach, 1979). Whereas we could show previously 
that the DAR moiety is derived from a condensation of two 
fatty acid metabolism intermediates by DAR enzymes 
(Fuchs etai., 2013), the enzymes for the biosynthesis of 
the polyene moiety are still unknown. However, mutant 
strains of Flavobacterium joiinsoniae UW101 revealed 
that the DAR biosynthesis genes from this flexirubin pro- 
ducer are flanked by other genes encoding proteins 
involved in flexirubin biosynthesis (McBride etai., 2009). 
One gene in this gene cluster encodes a putative enyzme 
from the aromatic amino acid lyase family. This enzyme 
family comprises histidine ammonia-lyases (HALs), 
phenylalanine ammonia-lyases (PALs) and tyrosine 
ammonia-lyases (TALs). HALs are common in bacteria 
and humans and deaminate histidine to urocanic acid 
(IVIichal, 1 999). PALs convert phenylalanine to E-cinnamic 
acid (CA), whereas TALs deaminate tyrosine to 
4-coumarate (4-hydroxycinnamic acid, 4CA). Whereas 
PAL and TAL are ubiquitous in plants and fungi, only a few 
examples were found in bacteria, such as Streptomyces 
maritimus PAL (Xiang and Moore, 2002), Piiotoriiabdus 
luminescens PAL (Williams etai., 2005) Rliodobacter 
capsuiatus TAL (Kyndt etai., 2002), Rtiodobacter 
spiiaeroides TAL (Xue etai., 2007), Streptomyces sp. 
Tu4128 TAL (Zhu etai., 2012) and Sacctiarotiirix 
espanaensis TAL (Berner etai., 2006). 4CA or CA can 
be activated by 4-coumarate-CoA ligases (4CL) or 
E-cinnamate-CoA ligases (CCL). 4CL are common in 
plants where they catalyse the last reaction in the 
phenylpropanoid pathway and have been extensively 
studied in the past. Only little is known about enzymes 
with 4GL activity from bacteria; as to the best of our 
knowledge, only the enzyme ScCCL from Streptomyces 
coeiicoior A3 (2) (Kaneko etai., 2003) and an unknown 
enzyme from Pseudomonas putida (Zenk etai., 1980) 
have been shown in vitro to have 4CL activity. Further- 
more, hints for the presence of 4GL in bacterial 
biosynthetic pathways were gained in works about the 
biosynthesis of enterocin in S. maritimus (Hertweck and 
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Fig. 1. Structure of flexirubin (1) from Chitinophaga filiformis and 
Chitinophaga pinensis. 

Moore, 2000) or the photoactive yellow protein in 
R. capsulatus (Kyndt etal., 2003). 

Here, we describe the putative flexirubin biosynthesis 
gene cluster of Chitinophaga pinensis and the biochemi- 
cal characterization of the TAL as well as the 4CL, both 
involved in the start of the polyene biosynthesis. Further- 
more, we characterized the flexirubin from C. pinensis 
by a combination of stable isotope labelling, high- 
performance liquid chromatography-electrospray ioniza- 
tion tandem mass spectrometry (HPLC-ESI-MS) and 
matrix-assisted laser desorption/ionization mass spec- 
trometry (IVIALDI-MS), presenting an alternative method 
for the detection of this pigment class in bacteria. 

Results and discussion 

We previously reported that heterologous expression of 
the genes darB (Cpin_6850) and darA (Cpin_6851) 
(Fig. 2A) from C. pinensis led to the production of a DAR, 
which is identical to the DAR moiety in flexirubin 1 from 



C. filiformes (Fuchs etal., 2013). Additionally, these 
genes have also been found in F. johnsoniae UW101 
where mutation of c/arS(Fjoh_1102) (Fig. 2B) resulted in a 
flexirubin-negative phenotype (McBride etal., 2009). In 
F. johnsoniae, darA and darB are part of a 37 kbp gene 
cluster and a spontaneous flexirubin-negative mutant was 
complemented by a plasmid carrying part of this cluster 
(Fjoh_1078 to Fjoh_1089) (Fig. 2B), which proved that 
additional genes from this cluster are involved in flexirubin 
biosynthesis (McBride etal., 2009). However, no further 
characterization of this gene cluster was reported. 
STRING analysis (Franceschini etal., 2013) of the 
C. pinensis genome with the F. yo/7nson/ae flexirubin gene 
cluster revealed a homologous gene cluster spanning 
Cpin_1853 (fIxA) to Cpin_1877 (fIxY) and the already 
reported DAR gene cluster (Fig. 2A and Tables S4 and 
S5). In the gene cluster, several putative (3-ketoacyl 
synthases, reductases, dehydratases and thioesterases 
are encoded, suggesting a type II fatty acid synthase-like 
biosynthesis of the polyene moiety (Fig. 3). As the 
polyene-aryl and the adjacent double bond are derived 
from tyrosine (Achenbach etal., 1979), we speculated 
that the predicted HAL FIxA (Cpin_1853) encoded in the 
biosynthesis gene cluster, in fact, catalyses the deamina- 
tion of L-tyrosine to 4CA, which might then be activated for 
the polyketide synthase (PKS) machinery by adenylation 
through the putative acyl-CoA ligase FIxY (Cpin_1877) 
(Fig. 3). 4-Coumaroyl-CoA may then be used as precursor 
for a fatty acid-like biosynthesis of the polyene by chain 
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Fig. 2. Known and proposed gene clusters for the biosynthesis of flexirubins in Chitinophaga pinensis DSM 2588 (A) and Flavobacterium 
johnsoniae UW101 (B). Genes encoding proteins FIxA, FIxY, DarA and DarB are shown with black frames. The blue arrow shows the region 
that complemented a spontaneous flexirubin-negative strain of F. johnsoniae (McBride etal., 2009). Colours depict genes with the same anno- 
tation, which are connected by grey lines if their identity was > 40% in a BLAST-P analysis. All genes are scaled to the depicted scale bar. 
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Fig. 3. Postulated biosynthesis of flexirubin in Chitinophaga 
pinensis. Dashed frames mark reactions previously shown or 
reported in this work. AT: acyltransferase; KS: ketosynthase; 
ACP: acyl carrier protein; KR: ketoreductase; DH: dehydratase; 
MCoA: malonyl-CoA; TE: thioesterase; MT: methyltransferase. 



elongation through the putative |3-l<etoacyl synthases 
(FIxC, Fix!, FIxL, FIxN, FIxO) followed by reduction of the 
P-keto function by FIxB or FIxV and generation of a double 
bond by a dehydratase (FIxF, FIxS). The aryl-octaene 
moiety may then be connected to the DAR by a ligase- 
like enzyme (FIxW). Flexirubin is found in the outer mem- 
brane (Irschik and Reichenbach, 1978). Thus, fIxP-flxU, 
encoding a putative polysaccharide deacetylase, a 
phospholipid/glycerol acyltransferase, an outer mem- 
brane lipoprotein carrier LolA, a glycosyltransferase and a 
predicted exporter may be involved in the export of the 
pigment by yet unknown mechanisms. 

Tyrosine ammonia-lyase activity 

Because two motives were identified in the past that might 
allow the prediction of the substrate specificity of aromatic 
amino acid ammonia-lyases, an alignment of the FIxA 
primary sequence with those of eight ammonia-lyases 
with known substrate specificity was performed (Fig. S1). 
However, the amino acids at motive 1 (Watts eta!., 2006) 
were similar to those of HAL enzymes and not TAL/PAL, 
whereas at motive 2 (Berner eta!., 2006), the conserved 
residue from TAL or PAL enzymes was found. To get 
experimental evidence whether FIxA has ammonia-lyase 
activity, it was heterologously produced in Esciiericiiia 
coli. Subsequent purification of FIxA (Fig. S2) allowed its 
incubation with putative substrates and the enzyme 
assays were analysed by gas chromatography-mass 
spectrometry (GC-MS). In assays containing FIxA and 
L-tyrosine, 4CA was detected (Fig. 4A and Fig. S3A), 
whereas L-phenylalanine was deaminated to CA 
(Fig. S3B). No product formation was detectable with 
L-histidine, L-isoleucine and L-tryptophane as substrates 
(data not shown). The same results were obtained in a 
photometric assay (Fig. S4A). To confirm the specificity 
of FIxA, photometric assays with L-tyrosine or L- 
phenylalanine as substrate were optimized towards pH 
and temperature (Fig. S4B). The linear plots of the 
Michaelis-Menten kinetics (Fig. S5A-D) revealed /(cat/KM 
values of 7728 M"^ s"'' for L-tyrosine and 7.7 M"'' s"^ for 
L-phenylalanine. Although other bacterial TALs showed a 
150- to 300-fold higher catalytic activity with L-tyrosine 
(Watts etai., 2006), FIxA showed a 1010-fold higher 
specificity towards L-tyrosine against L-phenylalanine, 
supporting its proposed function in the flexirubin 
biosynthesis. 

CoA iigase activity 

As the polyene-aryl and the adjacent double bond from 
flexirubin are derived from tyrosine (Achenbach etai., 
1979), we concluded that the 4CA produced by FIxA may 
be activated by the putative acyl-CoA Iigase FIxY to serve 
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Fig. 4. Analysis of assays with purified 
enzymes from the flexirubin 
biosynthesis. 

A. GC-MS analysis (TIC, positive mode) 
of L-tyrosine containing assays with (I) 
or without FIxA (II) and of 4-coumaric 
acid standard (III). 

B. HPLC-MS analysis of assays 
containing 4-coumaric acid (I) or 
£-cinnamic acid (II) with (solid line) or 
without (dashed line) FIxY and of 
4-coumaroyl-CoA standard (III). 
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as substrate for the following PKS biosynthesis of the 
polyene. An alignment of the primary sequence from FIxY 
with those from ScCCL from S. coelicolor and three 4CL 
isoforms from Arabidopsis thaliana is shown in Fig. S6. 
The alignment shows that, in contrast to ScCCL (44% 
identity to At4CI2), FIxY is not closely related to plant 4CLs 
(< 12% identity to At4CL1-3) and several residues in 
reported conserved domains differ from those of plant 
4CLs (Schneider etal., 2003) or ScCCL (Kaneko etal., 
2003), preventing an accurate prediction of the catalysed 
reaction. Therefore, we tested the acyl-CoA ligase activity 
of FIxY in vitro. After heterologous overexpression in 
E. CO// and subsequent purification (Fig. S2), its ability to 
perform the postulated reaction was tested In enzyme 
assays. HPLC-ESI-MS analysis of the enzyme assay with 
4CA as substrate revealed the formation of a compound 



with m/z 914 [M + H]+ (Fig. 4BI and Fig. S7AIII) and the 
same retention time and MS^ as 4-coumaroyl-CoA 
(Fig. 4BIII and Fig. S7AII), whereas In the presence of 
CA, a compound with m/z 898 [M + H]+ was formed 
that fragmented In MS^ experiments very similar to 
4-coumaroyl-CoA but with -A16 Da mass shifts as 
expected for £-clnnamoyl-CoA (Fig. 4BII and Fig. S7AIV). 
Furthermore, £-3-indoleacrylic acid, lndole-3-propionic 
acid, 3-chlorocinnamic acid and 3-phenylproplonic acid 
were also accepted as substrates, but other carboxylic 
acids were not (Fig. S8 and Table S3). To test the sub- 
strate specificity of FIxY, a coupled enzyme assay was 
used and optimized for product formation at different pH 
and temperatures (Fig. S9A and B). The Mlchaells- 
Menten kinetics with 4CA and CA were determined and 
the linear plots (Fig. S9C-F) showed /Ccsi/Km values of 
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Fig. 5. Mass spectra of flexirubin 1 from C. pinensis. 

A. IVIALDI-orbitrap mass spectra of wild-type 1 (I) and feeding experiments with ds-methionine (II) and d4-tyrosine (III) are shown. 

B. MALDI-iontrap-MS= of m/z 634.4 [M]+ . MALDI-iontrap-MS^ of m/z 293.2 (C) and m/z 343.3 (D) with m/z 634.4 [M]+- as precursor. 



52977 M"^ s"^ for 4CA and 437 M"^ s"^ for CA, giving a 
121 -fold higher catalytic activity with 4CA. Values for the 
only two other biochemical characterized bacterial 
enzymes with 4CL activity are given in Table S6 (Zenk 
eta!., 1980; Kaneko etal., 2003), showing that FIxY is the 
bacterial enzyme with the best 4CL activity reported to 
date. 

Flexirubin structure elucidation 

As C. pinensis is a close relative of C. filiformes in the 
Bacteroidetes phylum (Kampfer etal., 2006), we were 
interested whether both strains also produce the same 
flexirubin derivative. Crude acetone extracts of 
C. pinensis cultures were yellow, and a reversible colour 
shift upon treatment with alkali (Fig. SI OA) suggested 
the presence of flexirubin as described (Fautz and 
Reichenbach, 1980). Subsequent purification of the 
yellow compound and MALDI-MS analysis led to mass of 
m/z 634.40217 [Mf • and a sum formula of C43H54O4 (cal- 
culated m/z 634.401 66 [M]+-; Appm 0.8). The formation of 



a radical cation may be explained due to direct absorption 
of the laser light by the polyene instead of matrix- 
mediated ionization mechanisms. HPLC-UV-ESI-MS 
(Fig. S10B) confirmed that this compound also absorbs 
light at 420 nm, a characteristic of flexirubins. When 
C. pinensis was grown in the presence of L-[methyl- 
^Halmethionine, an additional signal with a mass increase 
of 3 Da was detected in the coloured fraction, indicating 
the incorporation of a methionine-derived methyl group 
(Fig. 5AIII). Similarly, a +3 Da mass shift was also 
observed when L-[ring-^H4]tyrosine was added to the 
culture (Fig. 5AIII), indicating its incorporation and the 
loss of one deuterium at the polyene ring, as expected for 
a methylation during the biosynthesis. Fragmentation of 
m/z 634.4 [M]+- by MALDI-MS led to four main fragments 
(Fig. 58), already reported for fragmentation of 1 with 
EI-MS (Achenbach and Kohl, 1 979), with m/z 343 and m/z 
293 reflecting an ester cleavage, m/z 315.2 as loss of CO 
(A28 Da) and m/z 265.1 as loss of benzene (A78 Da) from 
m/z 343 respectively. MS^ of m/z 293.2 resulted in a mass 
spectrum identical to the heterologously produced DAR 
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(Fuchs efa/., 2013) (Fig. 5C). The detected fragments 
can be explained by the loss of substituents at the aro- 
matic ring (Fig. 5C). IVIS^ of m/z 343.2 gave a complex 
fragmentation pattern with a key fragment of m/z 121.1 
(rearrangement to the tropylium cation is expected) and 
mass shifts of 26 and 28 Da (Fig. 5D), which turned out to 
be a characteristic of flexirubin polyene fragments. MS^ of 
m/z 315.2 and m/z 265.1 showed similar fragments and 
fragmentation patterns and reflect shorter polyene frag- 
ments (Fig. S10D and E respectively). When m/z 633.5 
[M - H]- was fragmented by HPLC-ESI-MS^ m/z 341 and 
m/z 291 could be detected (Fig. S10C). In addition, ana- 
lytical fragments with 39 and 65 Da mass shifts to m/z 291 
were detected. To confirm the assigned fragments, 
IVIALDI-MS^ analysis of the labelled C. pinensis pigments 
was performed. IVIS^ analysis of m/z 637 [M]+- observed in 
both labelling experiments (L-[methyl-^H3]methionine or 
L-[ring-^H4]tyrosine) showed a mass increase of 3 Da 
compared to the previously assigned polyene fragments 
m/z 343, m/z 315 and m/z 265, but no mass shift for the 
DAR fragment m/z 293 (Fig. SI OF, left side). Subsequent 
IVIS^ of the labelled polyene fragments m/z 346 from both 
feeding experiments allowed the detection of the label 
at the expected key fragment m/z 124 (Fig. S10F, right 
side). 

Therefore, from the colour of the isolated compound, 
positive KOH test result, incorporation of labelled precur- 
sors, and fragmentation in MALDI-IVIS and ESI-MS, the 
structure of 1 from C. pinensis is most likely similar to the 
reported flexirubin from C. filiformes (Achenbach eta!., 
1976). A remaining task would be to elucidate by means 
of nuclear magnetic resonance or chemical degradation if 
the methylation is indeed in ortho-position to the hydroxyl 
function of the polyene aryl, which is so far the only 
observed substitution pattern for flexirubins. As this is a 
time-consuming and challenging process for flexirubins 
(Fautz and Reichenbach, 1980), it would only be justified 
if a new derivative is expected. Nevertheless, most of 
the structure elucidation of 1 has been achieved 
without radioactive labelling and circumventing large- 
scale fermentations as it has been necessary before 
(Reichenbach etal., 1974; Achenbach and Kohl, 1978). 

Conclusions 

In summary, we showed that the genome of C. pinensis 
contains a flexirubin biosynthesis gene cluster, in which 
fIxA encodes a TAL and fIxY encodes a CoA ligase with 
specificity for 4CA. Only a few bacterial enzymes with 
PAL or TAL activity were reported before, whereas HAL 
are widely distributed. Due to the fact that, actually from 
sequence information alone, the substrate specificity of 
these enzymes cannot be predicted, the distribution of 
PAL and TAL enzymes in the Bacteroidetes phylum and 



in bacteria in general is still unknown. Even less is 
known about enzymes with 4CL activity from bacteria 
and FIxY shows a low similarity to the well-known plant 
4CL enzymes. The herein reported enzymes from 
C. pinensis may therefore become already characterized 
examples for future studies targeting the mechanisms 
that confer their substrate specificity and may help to 
develop solid predictions based upon sequence align- 
ments. Additionally, the first two steps in the polyene 
biosynthesis of flexirubins were elucidated, which have 
been unknown before. Furthermore, the identification of 
flexirubin with nowadays widespread and established 
HPLC-ESI-MS and MALDI-MS methods was reported 
and might lead to the identification of additional flexirubin 
producers. Its column chromatography is simple because 
even traces of pigment can be visually detected and less 
than 1 i^L of enriched fraction is needed for a complete 
MALDI-MS" analysis, enabling the characterization of 
flexirubins from newly described bacteria producing this 
chemotaxonomic marker. 



Experimental procedures 

Cultivation of strains 

All strains (Table SI) were cultivated on solid Luria-Bertani 
(LB) medium (5 g 1"^ of yeast extract, 10 g tryptone and 
5 g M NaCI) or in liquid LB medium on a rotary shaker at 
200 r.p.m. For feeding experiments and extraction of chromo- 
somal DNA, C. pinensis DSM 2588 was cultivated at 30°C 
and E. coli strains at 37°C. Where appropriate, kanamycin 
(50 ng ml"^) was added to the medium. 



Molecular-bioiogical metliods 

Molecular-biological experiments were performed according 
to standard procedures (Sambrook efa/., 1989). Polymerase 
chain reactions (PCRs) were performed with the Phusion 
high-fidelity polymerase (Thermo Fisher Scientific, Waltham, 
MA) according to the manufacturer's instructions and with 
oligonucleotides (Table S2) from Sigma-Aldrich (St. Louis, 
MO, USA). DNA extraction from agarose gels was performed 
with the GeneJet™ Gel Extraction Kit (Thermo Fisher Scien- 
tific) and plasmid isolations with the GeneJet Plasmid Extrac- 
tion Kit (Thermo Fisher Scientific). 



Construction of piasmids for tieterologous expression of 
FixA and FixY 

The genes cpin_1 853 (fIxA) and cpin_1 877 {fIxY) were ampli- 
fied by PGR from genomic DNA isolated from C. pinensis. The 
resulting products, 1853fragment and 1877fragment, were 
digested with the restriction endonucleases BamHI/Xhol and 
BamHI/Hindlll, respectively, and ligated into the similarly 
treated pCOLADuet-1 , followed by transformation of £. coli 
DH10B cells. The piasmids pCOLA-1853 and pCOLA-1877 
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were isolated and the inserts were sequenced at tlie SeqIT 
GmbH (Kaiserslautern, Germany). 

Overproduction and purification of FIxA and FIxY 

Esctiericliia coli BL21 (DE3) cells were transformed with 
pGOLA-1853 or pCOLA-1877, resulting in strain CS1853 and 
CS1877 respectively. Additionally, 200 ml of LB medium was 
inoculated 1:100 from an overnight culture of CS1853 or 
CS1877 and grown to a ODeoo of 0.6-0.8 at 37°C. Isopropyl- 
|3-D-1-thiogalactopyranoside was added to a final concentra- 
tion of 0.1 mM and cells were cultivated for 16 h at 18°C. After 
centrifugation, the cell pellets were frozen at -20°C. Gells 
were lysed according to the freeze-thaw method, by addition 
of a lysation buffer [500 mM NaCI, 20 mM imidazole, 20 mM 
tris(hydroxymethyl)aminomethane (Tris) (pH 7.5)], which 
additionally contained 0.005 volume of protease inhibitor 
cocktail set (Calbiochem, Merck, Darmstadt, Germany), 
lysozyme (lOkUmM) and benzonase (2.5 U mM). The 
soluble fractions were filtered through 0.6 |xM syringe filters 
and applied to a AKTAexplorer™ System (GE Healthcare, 
Fairfield, CT, USA) equipped with a HisTrap™ HP 1 ml affinity 
chromatography column. Binding and elution buffer con- 
tained 500 mM NaCI, 20 mM imidazole and 20 mM Tris 
(pH 7.5), and 500 mM NaCI, 500 mM imidazole and 20 mM 
Tris (pH 7.5) respectively. Eluted fractions were analysed 
by sodium dodecyl sulfate-polyacrylamide gel electro- 
phoresis (SDS-PAGE) and protein concentrations were 
determined with a Nanovue Plus Spectrophotometer (GE 
Healthcare). The HiSe-tagged proteins were stored at -80°C 
In storage buffer [100 mM NaCI, 50 mM Tris (pH 7.5), 1 mM 
dithiothreitol, 1 mM ethylenedlaminetetraacetic acid and 10% 
glycerol]. 

FixA in vitro assays 

Activity of FIxA with L-histidine, L-tyrosine, L-phenylalanine, 
L-isoleucine and L-tryptophan was tested in 1 ml of assays 
[1 \iM FIxA, 1 mM L-amino acid and 50 mM Tris (pH 8.0)]. 
After incubation (12 h, 25°C), assay products were extracted 
twice with 1 volume of ethylacetate, dried and redissolved in 
50 |xl of CHCI3 followed by derivatization (55°C, 2 h) with 
25 |xl of A/-methyl-A/-trimethylsilyl-trifluoroacetamide. Reac- 
tion products were detected on a 7890A gas chromatograph 
(Agilent, Santa Clara, CA, USA) equipped with a 5975C 
mass selective detector (Agilent). A DB5ht column (30 m x 
250 |im X 0.1 ^M; Agilent) was used for the separation of the 
analytes, with helium as the carrier gas (1 ml min"^). The 
method parameters were as follows: injection volume: 2 
Inlet temperature: 275°C; injection mode pulsed split (10:1); 
oven temperature programme: starting temperature 2 min at 
130°C, then 5°C min"' to 240°C followed by an increase to 
300°C with a rate of 30°C min"' or inlet temperature: 250°C; 
Injection mode splitless; oven temperature programme: start- 
ing temperature 5 min at 70°C, then 5°C min"^ to 300°C. 
Ionization of the analyte molecules was carried out by elec- 
tron impact ionization at 70 eV. For identification of the prod- 
ucts, 'Automated Mass Deconvolution and Identification 
Software' (AMDIS) version 2.64 in combination with the NIST 
library was used. 

To determine the substrate specificity of FIxA, photometric 
assays were performed in a 96-well plate using l- 



phenylalanine, L-tyrosine and L-histidine as substrates. For- 
mation of the reaction products was measured as the 
increase of the absorption at 275 nm (£-cinnamic acid), 
310 nm (4-coumaric acid) and 277 nm (urocanic acid) with a 
SpectraMax M5 Photometer (Molecular Devices, Sunnyvale, 
CA, USA). The pH and temperature optima were determined 
in 100 1^1 endpoint assays containing 1 |xM FIxA, 50 nM 
L-tyrosine or 20 mM L-phenylalanine with 50 mM Tris (pH 6- 
10) or 50 mM Na2B407 (pH 10-12) at 45°C, 40 min for pH 
optima and 50 mM Tris (pH 10) at 25-60°C, 1 h for tempera- 
ture optima. Michaelis-Menten kinetics were measured in 
photometric assays as triplicate [1 |xM FIxA, 50 mM Tris 
(pHIO), 45°C, 5 min, 5-1000 ^M L-tyrosine or 4-100 mM 
L-phenylalanine]. To calculate product concentrations, cali- 
bration curves with 4-coumaric acid (5-500 |xM) and 
£-cinnamic acid (10-2000 |xM) were obtained. The meas- 
ured data for both substrates were plotted as Michaelis- 
Menten kinetic and its derivations, Lineweaver-Burk and 
Hanes-Woolf, were used to calculate Km, Vmax, /feat and 

/(cat/ Km. 



FIxY 'in vitro assays 

Formation of E-cinnamoyl-CoAand p-coumaroyl-CoAby FIxY 
was analysed by HPLC-MS as described below. Therefore, 
enzyme assays [1 mM ATP, 1 mM CoA, 1 mM MgCl2, 50 mM 
Tris (pH 7.5), 37°C, 1 h] were performed. Depending upon the 
experiment, FIxY (1 ^M), £-cinnamic acid (1 mM) and 
4-coumaric acid (1 mM) were added. In addition, formation of 
CoA derivatives with other acids was tested. Enzyme assays 
[9.4 nM FIxY, 4.6 mM diverse acids, 1.4 mM CoA, 2.8 mM 
ATP 5 mM MgClj, 200 mM NaCI, 50 mM Tris (pH 7.5)] were 
incubated overnight at 37°C, stopped by the addition of formic 
acid (final concentration 4.6%) and analysed by HPLC-MS 
and MALDI-MS. For determination of the substrate specificity 
of FIxY, the EnzChek Pyrophosphate Assay Kit (Life Technol- 
ogies, Carlsbad, CA, USA) was used. The pH optimum was 
determined between pH 6.5 and pH 8.5 and with an endpoint 
assay (1 |iM FIxY, 1 mM 4-coumaric acid, 1 mM CoA, 1 mM 
ATP 1 mM MgClj, 50 mM Tris, 37°C, 1 min). To test the 
temperature dependence, the endpoint assay was performed 
between 25°C and 55°C at a pH of 7.5. Michaelis-Menten 
kinetics were measured as quadruplicate assays [0.5 |xM 
FIxY, 10-750 nM 4-coumaric acid or 50-7000 ^iM E-cinnamic 
acid, 1 mM CoA, 1 mM ATP, 1 mM MgCb, 50 mM Tris 
(pH 7.5), 55°C, 2 min]. Km, i/max, /(cat and /Ccat/Ky were calculated 
after plotting the data as Lineweaver-Burk and Hanes-Woolf 
plots. 4-Coumaroyl-CoA standard was obtained from 
MicroCombiChem (Wiesbaden, Germany). 



Feeding experiments 

Four hundred millllitres of liquid LB medium were inoculated 
1:100 from an overnight culture of C. pinensis. After 3 h, 
L-[methyl-^H3]methionine or L-[ring-^H4]tyrosine were fed in 
1 mM portions to the cultures. Feeding was repeated after 1 2, 
24 and 48 h, resulting in a final concentration of 4 mM. As 
control, a culture without feeding was cultivated at the same 
time. Cultures were harvested 12 h after the last feeding by 
centrifugation at 10,000 r.p.m. 
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Extraction and isolation of compounds 

Pelleted cells of liquid cultures were shaken for 30 min with 
50 mL of acetone in the dark, followed by filtration to remove 
cell debris. The extracts were dried under reduced pressure. 
The raw extracts were fractionated using silica gel columns 
with hexane/ethylacetate (4:1; 1% acetic acid). Coloured 
fractions were dried under reduced pressure. All samples 
were stored dry at -20°C in the dark and solved in 10-50 nl 
of acetonitrile (ACN) before further analysis. 

HPLC-MS 

HPLC-ESI-MS analysis was performed with a Dionex 
UltiMate 3000 system coupled to a Bruker AmaZon X mass 
spectrometer and an Acquity UPLC BEH C18 1.7 |xm RP 
column (Waters, Milford, MA, USA) as described previously 
(Reimer ef a/., 2011). 

MALDI-MS 

For MALDI-analysis, samples were mixed 1:2 with 1 ^1 
of a 20 mM 4-chloro-a-cyanocinnamic acid in 70% ACN with 
0.1% trifluoroacetic acid, spotted onto a polished stainless 
steel target and air-dried. MALDI-MS" analysis was per- 
formed with a MALDI LTQ Orbitrap XL (Thermo Fisher Sci- 
entific) equipped with a nitrogen laser at 337 nm, as 
described previously (Fuchs ef a/., 2012). Spectra were ana- 
lysed and possible sum formulas were calculated using Qual 
Browser (version 2.0.7; Thermo Fisher Scientific). 

Genome mining 

For identification of DarAB gene clusters, the primary struc- 
tures of the DarAB homologues from C. pinensis DSM 2588 
were used for a BLAST-P identification (protein-protein- 
BLAST) of homologues in F. johnsoniae UW101. The 
reported biosynthesis gene cluster from F. johnsoniae 
UW101 (McBride ef a/., 2009) was used in a STRING analy- 
sis (version 9.0) (Franceschini etal., 2013) for the identifica- 
tion of the putative flexirubin polyene cluster in C. pinensis. 
Gene colours in Fig. 2 are based upon NCBI annotation or 
domain-guided annotations from BLAST-P with primary 
sequences of C. pinensis genes as template (see Table S4 
and S5). Grey connections between genes depict an identity 
> 40%. 
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Fig. SI. ClustalW multiple alignment of primary sequences 
of FIxA from C. pinensis with those of histidine ammonia- 
lyases (HALs), phenylalanine ammonia-lyases (PALs) and 
tyrosine ammonia-lyases (TALs) with known substrate speci- 
ficity. Sequences used for the alignment were abbreviated as 
follows (with GenBank accession numbers in parentheses): 
HAL P.putida, HAL Pseudomonas putida (P21310); HAL 
S. griseus, HAL Streptomyces griseus (AAA26769); PAL 
P. crispum, PAL Petroselinum crispum (P24481); PAL 
P. luminescens, PAL Photorhabdus luminescens sp. 
laumondii TT01 (NP_929491); PAL S. maritlmus, PAL 
Streptomyces maritlmus (AAF81735); TAL C. pinensis, 
TAL Chitinophaga pinensis (YP_003121550); TAL 
R. sphaeroides, TAL Rhodobacter sphaeroides 2.4.1 
(YP_355075); TAL S. espanaensis, TAL Saccharothrix 
espanaensis (ABC88669); TAL S. sp. Tu 4128, TAL 
Streptomyces sp. Tu 4128 (AEV23249). The selectivity 
switch reported by Watts et al. (2006) is boxed in motive 1. 
The residue at the first position (here residue 140) varied 
between Phe (F) and His (H) in PAL sequences, whereas TAL 
or PAL with TAL activity have a His (H) at the first position. 
Furthermore, a second conserved residue was reported 
(Berner et al., 2006), which is boxed in motive 2 (here residue 
507) and was always Glu (E) for HAL and Gin (Q) for TAL or 
PAL. The alignment was performed using Geneious 6.1.7 
ClustalW default settings. Coloured residues indicate a simi- 
larity > 75%. 

Fig. S2. SDS-PAGE analysis of purified proteins. Left side: 
Purified FIxA (expected size 58 kDa). Right side: Purified 
FIxY (expected size 49 kDa). Marker: PageRuler™ 
Unstained protein ladder (Fermentas). 
Fig. S3. GC-MS analysis of FIxA enzyme assays. A. Mass 
spectrum of the product observed after incubation of FIxA 
with L-tyrosine (I) and 4-coumaric acid standard (II). B. 
Chromatogram of an assay containing FIxA and 
L-phenylalanine (I), the mass spectrum of the product at 
6.1 min from the above chromatogram (II) and the mass 
spectrum of E-cinnamic ac\d standard (III). 
Fig. S4. Photometric FIxA enzyme assays. A. Endpoint 
assay detecting the enzymatic conversion of L-tyrosine 
(blue), L-phenylalanine (red) and L-histidine (green) by meas- 
uring the absorption increase of 4-coumaric acid (310 nm), 
£-cinnamic acid (275 nm) or urocanic acid (277 nm) respec- 
tively. B. pH dependence of FIxA activity with L-tyrosine (I) 
and L-phenylalanine (II). Temperature dependence of FIxA 
activity with K-tyrosine (III) and L-phenylalanine (IV). TAL: 
tyrosine ammonia-lyase; PAL: phenylalanine ammonia-lyase; 
HAL: histidine ammonia-lyase. 

Fig. S5. Michaelis-Menten kinetics of FIxA. Substrate satu- 
ration plots of FIxA with L-tyrosine (A) and L-phenylalanine (B) 
and their respective linearization as Lineweaver-Burk (C, D) 
or Hanes-Woolf plots(E, D). TAL: tyrosine ammonia-lyase; 
PAL: phenylalanine ammonia-lyase. 

Fig. S6. ClustalW multiple alignment of primary sequences 
of FIxY from Chitinophaga pinensis with three 4-coumarate- 
CoA ligases (4CL) from Arabidopsis thaliana and the 
£-cinnamate-CoA ligase (CCL) ScCCL from Streptomyces 
coelicolor A3(2). Sequences used for the alignment were 
abbreviated as follows (with GenBank accession numbers 
in parentheses): At4CL1, A. thaliana 4CL1 (AAA82888.1); 
At4CL2, A. thaliana 4CL2 (AF1 06086_1 ); At4CL3, A. thaliana 
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4CL3 (AF106088_1); ScCCL, S. coelicolor A3 (2) CCL 
(CAB95894); Cp4CL, C. pinensis 4CL (YP_0031 21 574.1). 
Two conserved peptide boxes from 4CL are boxed in red and 
nine residues tliat are suggested to form the At4CL substrate 
binding poctcet are marl<ed by red triangles (Kanel<o etal., 
2003). The alignment was performed using Geneious 6.1.7 
ClustalW default settings. Coloured residues indicate a simi- 
larity > 75%. 

Fig. S7. HPLC-MS analysis of enzyme assays with FIxY. 

A. Mass spectra of CoA (I), 4-coumaroyl-CoA standard (II) 
and products detected in enzyme assays containing FIxY and 
4-coumaric acid (III) or £-cinnamic acid (IV) as substrates. 

B. Theoretical fragmentation tree explaining the above mass 
spectra with randomly positioned charges. 

Fig. S8. HPLC-MS analysis of the substrate specificity of 
FIxY. The structures of the substrates, the EIC from CoA 
(green) and the EIC of expected substrate-CoA thioesters 
(blue) are shown. 

Fig. S9. Results of enzyme assays with FIxY. pH (A) and 
temperature optimum (B) of the coupled enzyme assay. Sub- 
strate saturation plots of FIxY with 4-coumaric acid (C) and 
E-cinnamic acid (D) and their respective linearization as 
Lineweaver-Burk (E, F) or Hanes-Wooif plots (G, H). 
Fig. S10. KOH test, mass spectra and HPLC traces from 

C. pinensis flexirubin 1. A. Crude acetone extract of 
C. pinensis culture (left) showing the reversible colour 
shift after addition of base (middle) or base and acid 
(right). B. HPLC-MS-chromatograms of C. pinensis extract 
before (upper trace) and after fractionation by column 



chromatography (lower trace). UV at 420 nm (blue line) and 
an EIC m/z 633.5 [M - H+] are shown. The occurrence of an 
additional signal after purification may be explained by 
isomerization. Chromatograms are drawn to the same scale. 
C. HPLC-ESI-MS" of 1 m/z 633.5 [M - H+]. MALDI-iontrap- 
MS^ spectra of m/z 315.2 (D) and m/z 265.3 (E) with m/z 
634.4 [M]+ as precursor. F. On the left side, MALDI-iontrap- 
MS" of wild-type 1 m/z 634.4 [M]+ (I) and m/z 637.4 [M]+ from 
feeding experiments with ds-methionine (II) and d4-tyrosine 
(III). The resulting fragments m/z 343.2 for wild type and m/z 
346.2 or m/z 346.3 for the feeding experiments were further 
fragmented and the corresponding MS^ mass spectra are 
depicted on the right side. 

Table SI. Strains and plasmids used In this work. 
Table S2. Primers and PCR products used in this study. 
Table S3. High-resolution MALDI-orbitrap-MS of CoA- 
thioesters from FIxY substrate assays. 
Table S4. Predicted gene clusters for flexirubin biosynthesis 
in C. pinensis. Domain-guided annotation Is based on 
conserved domains detected by BLAST-P of C. pinensis 
DSM 2588 primary sequences against the genome of 
F. jolinsoniae UW101 . 

Table S5. Gene cluster for flexirubin biosynthesis in 
F. jolinsoniae UW101 . Domain-guided annotation is based on 
conserved domains detected by BLAST-P. 
Table S6. Comparison of the enzymatic properties of bacte- 
rial 4CLs (4-coumarate-CoA ligase) and CCLs (cinnamate- 
CoA ligase). Values were obtained from assays with 
4-coumarate (4CA) and £-cinnamate (CA) as substrates. 
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